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ABSTRACT 

Resistance to drugs targeting liuman thymidylate 
syntliase (TS) poses a major cliallenge in the field 
of anti-cancer therapeutics. Overexpression of the 
TS protein has been implicated as one of the factors 
leading to the development of resistance. Therefore, 
repressing translation by targeting the TS mRNA 
could help to overcome this problem. In this study, 
we report that the compound Hoechst 33258 (HT) 
can reduce cellular TS protein levels without 
altering TS mRNA levels, suggesting that it modu- 
lates TS expression at the translation level. We have 
combined nuclear magnetic resonance, UV-visible 
and fluorescence spectroscopy methods with 
docking and molecular dynamics simulations to 
study the interaction of HT with a region in the TS 
mRNA. The interaction predominantly involves inter- 
calation of HT at a CC mismatch in the region near 
the translational initiation site. Our results support 
the use of HT-like compounds to guide the design of 
therapeutic agents targeting TS mRNA. 

INTRODUCTION 

Thymidylate synthase (TS) catalyzes the reductive niethy- 
lation of 2'-deoxyuridiiie-5'-moiiophosphate (dUMP) to 



2'-deoxythymidiiie-5'-monophosphate (dTMP). It plays a 
key role in the biosynthetic pathway that, in human cells, 
provides the sole de novo source of thymidylate. As 
thymidylate is an essential precursor required for DNA 
replication and repair (1), TS is an attractive anti-cancer 
target. Drugs inhibiting the TS protein lead to its over- 
expression and consequent development of drug resist- 
ance. The effect has been attributed, at least in part, to 
the interruption of the autoregulatory feedback loop 
caused by disruption of the TS protein-TS mRNA inter- 
action (2). Therefore, repression of TS translation by 
targeting the TS mRNA could provide an effective 
means of overcoming the development of resistance. The 
TS-mRNA sequence spanning nucleotides 75-110, 
referred to as Site I, encompasses the start codon and 
has been shown to be relevant for the TS mRNA- 
protein interaction. It is predicted to have a stable 
stem-loop structure with a CC mismatch in the stem 
(Figure lA) (3). Pubhshed literature on the interactions 
of ligands, such as aminoglycosides and HOECHST 
33258 (HT) (Figure IC), with Site I-like RNA constructs 
(4-6) suggests that these bind at or in the vicinity of the 
CC mismatch. Therefore, Site I provides a functionally 
relevant structural motif that may be targeted to develop 
novel anti-cancer drugs. 

HT is a fluorescent dye widely known to bind at the 
minor groove of AT-rich B-DNA (7). The HT-DNA 
complex is suggested to be stabilized by hydrogen 
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Figure 1. (A) Depiction of TSmRNA witli the predicted stem-loop sec- 
ondary structure of Site I (nucleotides 75-110) containing the start 
codon. (B) Predicted secondary structures of the RNA constructs, 
TSMC, TSGC and TSl. The sequence and structural elements of Site 
I preserved in the RNA constructs are shown in black, whereas the rest 
of the RNA construct is gray. (C) 2D structure of Hoechst 33258 (HT) 
with the atom, ring and torsion angle nomenclature used in the text. 



bonding and van der Waals interactions with the deep 
minor groove of B-form DNA hehces. However in GC- 
rich DNA regions, partial intercalation has been observed 
(8-10). Based on electric linear dichroism studies, HT has 
also been proposed to intercalate at the pyrimidine bulge 
in the TAR RNA of HIV-1, although the authors ques- 
tioned the structural viabihty of intercalation due to the 
steric hindrance of the bulky methylpiperazine and 
hydroxyl phenyl groups on either side of the bis- 
benzimidazole fragment of HT (11). HT is readily taken 
into cells and is known to have moderate anti-leukemic 
and anti-helminthic activity, although the mechanism of 



cytotoxicity is not well understood (12). HT has been 
observed to bind specifically to a TS RNA Site 1 construct 
with a dissociation constant of 60 ± 13 nM; the binding is 
facihtated by the presence of a CC misinatch and competi- 
tive with binding of the aminoglycoside, paromomycin (4). 
Mutation and RNase footprinting experiments indicated 
that the specific binding of HT required non-duplex 
RNA, was favored by the presence of GC base pairs 
adjacent to the mismatch but not sensitive to the base 
type at the bubble (4). 

To investigate the biological relevance of the interaction 
of HT with the TS mRNA, we performed cell-based assays 
and monitored the effect of HT on the levels of TS mRNA 
and protein. Surprisingly, we observed that HT reduced 
the TS protein levels by acting at the level of translational 
regulation, raising the possibility that HT might directly 
interact with the TS niRNA in the cell. To exploit HT as a 
lead compound for the design of anti-cancer agents tar- 
geting the TS mRNA, a detailed structural characteriza- 
tion of HT-mRNA binding is desirable. Since the CC- 
mediated HT binding site on TS mRNA (4) is distinct 
from the HT binding site observed for the TAR RNA 
(11), a direct deduction of the binding mode from that 
for TAR RNA is not feasible. We therefore studied the 
molecular details of HT-TS mRNA interactions using 
nuclear magnetic resonance (NMR), UV-Vis and fluores- 
cence spectroscopy techniques, complemented with com- 
putational docking and molecular simulations. For this 
purpose, we analyzed three RNA constructs: TSl, 
TSMC and TSGC (Figure IB). TSl has the native pre- 
dicted stein loop structure of Site I, stabilized by two add- 
itional GC base pairs; its interaction with HT was 
reported by Cho et al. (4). TSMC is a shorter construct 
that has the same three base pairs as Site I flanking the CC 
mismatch in both directions; its interaction with 
paromomycin has previously been studied by NMR (6). 
The CC mismatch of TSMC has been replaced by a GC 
base pair in TSGC. Our data show that HT binds the Site 
I-like RNA constructs in an ensemble of modes with inter- 
calation at the site of the CC bubble being the dominant 
binding mode. 



MATERIALS AND METHODS 

Materials 

RNA oligonucleotides 

TSMC 5'-(r(GGC CCG CCG AAA GGC CGG CC))-3', 
TSGC 5'-(r(GGC CGG CCG AAA GGC CGG CC))-3' 
and TSl 5'-(r(GGG CCC GCC GCG CCA UGC CUG 
UGG CCG GCC C))-3' were purchased from Biospring 
GmbH, Frankfurt, Gennany. The RNA was extensively 
dialyzed in 500 Da dialysis membrane tubes against pro- 
gressively decreasing NaCl concentrations (1-0 M) and 
subsequently lyophihzed. Just before measurements, the 
samples were dissolved and subjected to heating at 95° C 
for 5 min followed by snap coohng on ice with the aim of 
trapping the kinetically favored intramolecular mono- 
meric hairpin conformation rather than an intermolecular 
dimer. The stoichiometry of the samples was checked with 
native PAGE. 
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DNA oligonucleotides 

AT-DNA (5'-AAG GCT ACG ATC CGC ATC CGA 
AAA TTA AAG CGC CGG TGG CTA T-B'/S'-TTC 
CGA TGC TAG GCG TAG GCT TTT AAT TTC 
GCG GCC ACC GAT A-3') and GC-DNA (5'-GCG 
GCC GC-3') were purchased from Eurofins MWG 
Operon, Ebersberg, Germany. Just before measurements, 
the samples were dissolved and subjected to heating at 
95°C for 5 min followed by slow cooling to room tempera- 
ture to favor the formation of a duplex. 

HT 

HT was purchased from Sigma Aldrich (ID B1155). 
Cell line 

The 2008 cell line was estabhshed from a patient with 
serous cystadenocarcinoma of the ovary. The cells were 
grown as monolayers in RPMl 1640 medium containing 
10% heat-inactivated fetal bovine serum and 50 |ig/ml 
gentamycin sulfate. All cell media and serum were 
purchased from Lonza, Verviers, Belgium. Cultures were 
equilibrated with humidified 5% CO2 in air at 37°C. All 
studies were performed in Mycoplasma negative cells, as 
routinely determined with the MycoAlert Mycoplasma de- 
tection kit (Lonza, Walkersville, MD, USA). 

Methods 
Biological assays 

Drug treatment. The cells were administered with 
increasing concentrations of HT (0, 0.1, 1, 2.5, 5, 7.5, 10 
and 20 |.iM) and incubated for 48 and 72 h. In addition, the 
compound Nl,N12,bis(ethyl)spermine (BESpni) was ad- 
ministered to the cells and incubated for a period of 72 h 
as a control. BESpm is a bis(ethyl) analog of the naturally 
occurring polyamine spermine. It has been shown to 
downregulate the key enzymes for polyamine biosynthesis, 
ornithine decarboxylase and S-adenosylmethionine de- 
carboxylase, and to induce the activity of the rate-limiting 
enzyme in polyamine catabohsni, spermidine/spermine 
A^'-acetyltransferase, resulting in depletion of polyamine 
pools and inhibition of ceU growth at micromolar concen- 
trations in several tumor models (13). Since, no effect of 
BESpm alone on TS expression has been reported, the 
compound was chosen as a control molecule to confirm 
the specific activity of HT. 

Western blotting. Cells were harvested, washed twice in 
ice-cold Ix PBS, resuspended in 20 mM Tris-HCl (pH 
7.4), 150mM NaCl, 1 mM EDTA (pH 8.0), 1% Triton 
X-100 and 0.1% SDS and lyzed by freeze-thawing three 
times followed by sonication using three 2—3 s bursts. The 
insoluble debris was removed by centrifugation at 
1 5 000 X g for 30 min. Protein concentrations were 
determined using the method of Lowry (14). In total, 
25 |.ig of each protein sample was resolved by SDS- 
PAGE (12%). The gels were electroblotted onto hydropho- 
bic polyvinyhdene difluoride membranes (HybondTM-P 
PVDF, GE Healthcare Bio-Science, Uppsala, Sweden). 
Antibody staining was performed with a chemilumines- 
cence detection system (ECL Plus Western Blotting 
Detection Reagent, GE Healthcare Bio-Science), using a 



1:500 dilution of the anti-human TS mouse TS106 mono- 
clonal primary antibody (Invitrogen S.r.L., Milan, Italy), 
1:10000 of anti-human vincuhn mouse antibody for nor- 
malization (Sigma-Aldrich S.r.L., Milan, Italy) and 1:3000 
dilution of a horseradish peroxidase-conjugated sheep anti- 
mouse secondary antibody (GE Healthcare Bio-Science). 
Quantitation of signal intensity was performed by densi- 
tometry on a GS-800 cahbrated densitometer (Bio-Rad, 
CA, USA) and analyzed using Quantity One software 
(Bio-Rad). The protein expression of the housekeeping 
gene vinculin was used as a control to normalize the 
measured protein levels. 

Real-time reverse transcription-PC R analysis. Total RNA 
was extracted from the cultured cells using TRI reagent 
(Sigma-Aldrich S.r.L.). Reverse transcription was per- 
formed with 2|ig of total RNA using random primers 
(Promega, Milan, Italy) and M-MLV reverse transcriptase 
(Promega). Real time RT-PCR was performed with 10 ng 
of cDNA using Power SYBR® Green PCR Master Mix 
[Applied Biosystems, Monza (MI), Italy] and an ABI 
PRISM 7900 HT Sequence Detection System (Applied 
Biosystems), followed by a dissociation curve analysis 
and subsequent agarose gel electrophoresis to confirm 
amplification. The following TS primers (Genbank: 
NM_001071.1) were used, forward: 5'-CAG ATT ATT 
CAG GAC AGG GAG TT-3' and reverse: 5'-CAT 
CAG AGG AAG ATC TCT TGG ATT-3'. The amount 
of target, normalized to an endogenous reference 
[glyceraldehyde-3-phosphate dehydrogenase (GAPDH)] 
and relative to a calibrator (untreated sample), was 
given by 2"^^^*"' calculation (15). All experiments were 
carried out three times in triplicate; amplification plots 
were analyzed using the ABI Prism 7900 HT SDS 
version 2.1 software (Apphed Biosystems). 

NMR spectroscopy 

RNA samples were measured at 100 nM concentration, in 
90% H2O+10% -H2O or 100% -H2O, as required and 
adjusted to pH 6.4 with dilute HCl/NaOH. HT was 
dissolved at lOmM concentration in the same solvent as 
the RNA, and the pH was adjusted to 6.4. 

RNA chemical shift changes upon HT addition were 
monitored using homonuclear 2D Total Correlation 
Spectroscopy (TOCSY) (16,17) (t,„ = 80 ms) and 2D 
Nuclear Overhauser Effect Spectroscopy (NOESY) (18) 
(im = 300 and 80 ms) experiments at HT/TSMC ratios 
of 0:1, 0.5:1, 1:1 and 1.5:1. Significant precipitation at 
the highest ratio precluded the use of further titration 
points. AU experiments were performed on a 900-MHz 
Bruker spectrometer with a cryogenically cooled probe. 
Measurements were done at 293 K which was found to 
be the optimal temperature for both water and ^H20 
measurements. The chemical shift assignments for the 
free TSMC reported by Tavares et al. (6) were used. 

UV- Vis, fluorescence spectroscopy 

AU samples were prepared in 20niM phosphate buffer 
solution at pH 7.5. Absorption spectra were measured 
with a Varian Caryl 00 dual beam spectrophotometer. 
Fluorescence spectra were measured with a FluoroMax2 
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Spex Jobin-Yvon spectrofluorometer. Emission spectra 
were corrected for the spectral sensitivity of the emission 
channel; excitation spectra were normalized relative to the 
spectral intensity distribution of the xenon lamp; the cor- 
rectness of the procedure was checked in the 300-390 nm 
region by comparing the absorption and excitation spectra 
of anthracene in «-hexane. All measurements were per- 
formed in 0.4 X 1 cm^ cuvettes at 19-21°C. 

Time-correlated single photon counting experiments 
were performed with an IBH System 5000 equipment 
having a time resolution around 0.25 ns after deconvolu- 
tion. The fluorescence decay functions were deconvoluted 
from the measured time profiles using a combined hnear 
and non-linear iterative reconvolution method based on 
minimization of the chi-square parameter. 

Computational modeling 

Preparation and parameterization of the starting 
structures. Model 6 from the solution structure ensemble 
of TSMC reported by Tavares et al. (PDB ID 2RPT) (6) 
and HT coordinates from solvation dynamics studies 
kindly supplied by Dr S.A. CorcelH (19) were used for 
modehng the complex of TSMC with HT. HT was 
manually docked (visualization was done in Pymol 1.2r2) 
in the major groove at the crevice formed by the CC 
mismatch. Since the orientation of HT with respect to 
RNA was not known, two docking poses rotated 180° 
with respect to each other were generated (Supplementary 
Figure SI). The ParnibscO force field (20) was used to par- 
ameterize the RNA, and the published parameters (19) 
were used for HT. 

Molecular dynamics simulations. All molecular dynamics 
(MDs) simulations were performed under periodic 
boundary conditions using the SANDER module of 
AMBERIO (21). Water molecules were modeled with the 
T1P3P (22) potential, and the system was neutralized with 
Na"^ ions. Electrostatic interactions were computed using 
the particle mesh Ewald method for which the direct sum 
cutoff was set to 9 A. Non-bonded interactions were also 
cutoff at 9 A. Bonds involving hydrogen atoms were con- 
strained to their equilibrium lengths using the SHAKE (23) 
algorithm. Newton's equations of motion were integrated 
every 2 fs. A variant of a previously reported protocol (24) 
was used to minimize, heat and equilibrate the systems 
(Supplementary Figure S2). All simulations were per- 
formed in the NPT ensemble. A constant temperature of 
300 K was maintained by weak coupling to an external 
bath (25) with a time constant of 5ps. The pressure was 
maintained at 1 atm by isotropic position scaling. 

A two-step procedure was used to simulate the complex. 
In the first step (referred to as A 1 and Bl, for the docking 
poses in Supplementary Figure SI A and B, respectively), 
the fully flexible HT was aUowed to explore the surface of 
a restrained TSMC structure and find an optimal docking 
pose. This was accomplished by harmonically restraining 
the atoms of TSMC with a force constant of 25 kcal/mol/ 
A^, while aUowing HT and the solvent atoms to move 
freely during a production run of 10 ns. The final frame 
from the Al step was taken as the initial frame for the 
second step (referred to as A2) in which induced fit of the 



HT-TSMC interaction at the CC mismatch was explored. 
This was accompHshed by aUowing unrestrained 
movement of HT, solvent and residues 4-6 and 16-18 of 
TSMC, i.e. of the CC mismatch and one flanking base pair 
above and below the mismatch, while the rest of the RNA 
was harmonically restrained with a force constant of 
lOkcal/mol/A^, during a production run of 24.6 ns. The 
conformations sampled during the first 5 ns and the last 
500 ps of the A2 run were independently averaged and 
then minimized using steepest descent and conjugate 
gradient methods to obtain representative models for 
HT-TSMC groove binding (Figure 2A) and HT-TSMC 
intercalation (Figure 2B) complexes, respectively. 

To estimate the binding free energy for the RNA-hgand 
interaction, continuum solvent calculations were per- 
formed using the internal Poisson-Boltzmann Surface 
Area (PBSA) solver in Sander in the Amber9 package. 
To compute binding free energies for the groove bound 
mode, the intercalation mode and the intermediate transi- 
tion phase, the trajectory from the A2 run was split into 
segments of 5 ns each (the last segment being 4.6 ns) which 
were then used independently for the calculations. The 
structures of unbound TSMC and HT were taken from 
the simulated complexes, i.e. changes in the conformation 
of each of the components were neglected for the binding 
free energy calculations. 

For reference, the unbound forms of TSMC and of HT 
were simulated for 9 and 2 ns, respectively, using the same 
settings as for the A2 run. 

RESULTS AND DISCUSSION 

HT-induced translational repression of TS mRNA 

Cell-based assays were conducted to monitor the TS 
protein and mRNA levels at different administered doses 
of HT in an ovarian cancer cell fine. The TS mRNA levels 
were essentially unaltered after HT exposure, whereas the 
TS protein levels were progressively reduced to 25 and 
17% of the control on administration of increasing 
doses of HT for 48 and 72 h, respectively (Figure 3), sug- 
gesting that HT affects intracellular TS protein levels by 
translational regulation. This observation is unanticipated 
considering that HT is a strong DNA binding compound 
and therefore expected to preferentially modulate tran- 
scription. In addition, since the expression of the 
vincuhn protein remained unaltered, and the control 
compound BESpm did not reduce TS expression as 
much as HT (Supplementary Figure S3), the observed 
effect of HT on TS shows specificity. It has already been 
shown that HT can bind at the stem of Site I-hke RNAs 
(4,6) and we thus hypothesize that direct interaction with 
TS mRNA could be a possible mechanism for the HT- 
mediated repression of TS translation. To obtain 
detailed structural insights into the HT-Site I interaction, 
we next pursued experiments using the TSMC construct as 
a model for the stem of the Site I RNA (Figure lA and B). 

CC mismatch in RNA is the binding site for HT 

The prediction of a stem-loop structure for Site I, with a 
CC mismatch in the stem and the start codon in the loop, 
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Figure 2. Structural models of (A) the groove binding HT-TSMC complex generated by averaging and minimizing the first 5 ns of the simulation A2. 
TSMC is depicted as a cartoon with the major groove facing the reader, whereas HT is shown in stick representation (C: cyan, N: blue, O: red); and 
(B) the HT-TSMC intercalation complex generated by averaging and minimizing the last 500 ps of the simulation A2. 



was first reported by Chu et al. (3). In agreement with the 
prediction, a Site 1-like RNA construct was reported to 
have a high melting temperature of 65° C independent of 
the concentration, suggesting an intra-molecular stem- 
loop fold (26). Another Site I-like RNA construct was 
also observed to adopt a stem-loop fold by solution fluor- 
escence studies, although an asymmetric dimer of two 
hybridized RNA strands was obtained in the crystal 
(27). These reports suggest that the Site I indeed adopts 
a stem-loop structure in solution. Tavares et al. (6) have 
recently reported the NMR structure of a shorter con- 
struct, TSMC, and mapped the binding site for the 
aminoglycoside, paromomycin. In TSMC, the three 
flanking base pairs on either side of the CC mismatch in 
the Site I stem are preserved and the resulting stem 
sequence is capped with a stable GNRA (G = guanine, 
N = any nucleotide, R = purine, A = adenine) tetraloop 
(Figure IB). 

TSMC was used to study the structural features of 
the RNA-HT complex by NMR. A 'H TOCSY experi- 
ment, recorded in ^H20, was used to monitor H6/H5 
cross-peaks for the pyrimidine residues (exclusively cyto- 
sines in the case of TSMC). An overlay of TOCSY spectra 
recorded in the presence of increasing concentrations of 



HT is shown in Figure 4. The H6/H5 correlations for C19 
and C20 experienced neghgible changes in chemical shifts 
compared with the remaining cytosine residues, suggesting 
that the binding event occurred at a position distant from 
the base of the stem. Signal broadening was observed for 
the H6/H5 correlations of CI 5, and to a lesser extent, for 
the joint signal of C16 and C7. Presumably, these residues 
experience exchange-broadening motions when TSMC is 
bound to HT. These chemical shift perturbations (CSPs) 
suggest that the major interaction site between HT and 
TSMC is the RNA's CC mismatch. 

To further investigate the role of the CC mismatch in 
HT binding, an additional RNA construct, termed TSGC 
(Figure IB) was also used. In this construct, C5 was 
replaced with G5, thereby eliminating the mismatch and 
replacing it with a GC base pair. However, assigning this 
construct proved to be very challenging due to the very 
high degree of overlap in the NOESY (Supplementary 
Figure S4D). A few resonances in this experiment, as 
weU in a TOCSY experiment, were assigned tentatively 
based on the comparison of the characteristic chemical 
shifts with the TSMC spectra (Supplementary Figure 
S4). For both TSMC and TSGC, chemical shift changes 
upon HT addition were observed for adenine resonances 
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in 2D NOESY spectra (Supplementary Figure S4C and 
D). In addition, the H6/H5 signals of residue C8 in the 
TOCSY for TSMC (Figure 4) and (based on tentative as- 
signments) in TSGC (Supplementary Figure S4B) experi- 
ence significant chemical shift changes. These changes 
show that HT also interacts with the GNRA tetraloop 
in both TSMC and TSGC. In addition, the TSMC-HT 
interaction was studied using a NOESY experiment 
recorded in H2O (Supplementary Figure S5). Since every 
canonical Watson-Crick base pair (except the one at the 
bottom of the stem) gives rise to a set of cross-peaks in the 
imino-imino and in the imino-amino regions, chemical 
shift changes for these protons can be used to determine 
whether HT binding affects TSMC base pairing. Chemical 
shift changes for the TSMC RNA were monitored at HT/ 
RNA ratios of 0:1, 0.5:1, 1:1, 1.5:1 and revealed most 
significant changes for the G17-C4 base pair. G18-C3 
G6-C15 and G14-C7 also seem to be affected; however, 
a high degree of overlap precludes further evaluation. 
These changes are consistent with our hypothesis that 
HT binds TSMC at two sites: the CC mismatch and the 
GNRA tetraloop. The imino-amino signals from all base 
pairs are conserved at HT/RNA ratios, indicating that the 
HT addition does not affect TSMC base pair integrity. 

Intercalation is the major binding mode of HT to RNA 

We were not able to determine a high-resolution NMR 
structure of the HT-TSMC complex because of a lack of 
intermolecular Nuclear Overhauser Effects (NOEs), pre- 
sumably due to the presence of conformational dynamics 
and multiple interactions of HT with the RNA. The same 
limitation was reported by Tavares et al. for the TSMC- 
paromomycin complex (6). We therefore used UV-Vis and 
fluorescence methods to further characterize the inter- 
action of HT with the TS mRNA. HT is fluorescent and 
thus UV-Vis and fluorescence spectroscopy can report on 
changes in the environment of HT upon RNA binding, 
which in turn provides clues about the binding mode. Two 
binding modes have been characterized in the hterature 
for the interaction of HT with DNA — groove binding at 
AT-DNA and partial intercalation at GC-DNA (8-10). 
Therefore, we first characterized the interaction of HT 
with an AT-rich DNA sample and a GC-rich DNA 
sample to obtain reference spectroscopic and 
photophysical datasets for the two binding modes 
(Supplementary Figures S6 and S7, respectively). Indeed, 
titration of HT with AT- and GC-DNAs caused different 
evolutions of the UV-VIS absorption and steady-state and 
time-resolved fluorescence observables. The main 
properties of the HT-DNA complexes obtained at large 
DNA/HT ratios are reported in Table 1. They are consist- 
ent with the two mentioned binding modes, as discussed in 
detail in the Supplementary data. 

Binding mode to RNA 

In addition to TSMC and TSGC, the RNA construct TSl 
(4) was used for these experiments. TSl, has the native Site 
I sequence and structure stabilized by two additional GC 
base pairs at the base of the stem and was used as a closer 
representative of the TS mRNA. Titration of HT with the 



Table 1. Spectroscopic (absorption, emission and excitation maximum 
wavelengtlis, X"; relative maximum extinction coefficients, e™) and 
photopliysical features (relative emission band areas, A^; and lifetimes, 
t) of HT and HT/NA complexes at large NA/HT mole ratios 



Nucleic acid 


(nm) 


pin 

^HT 

pin 

''HT/NA 


(nm) 


^pT/NA 
^HT 


(nm) 


T(ns) 


None 


340 


1 


510 


1 


349 


0.3 (±0.15) 


AT-DNA 


352 


~1 


445 


-60 


354 


2.8 


GC-DNA" 


342 


0.8 


473 


3.8 


361 


4.1 


TSMC-RNA'' 


346 


1.2 


483 


7.7 


367 


4.2 


TSGC-RNA' 


342 


-1.8 


480 


7.6 


360 


4.3 


TSl -RNA'' 


355 


1.4 


485 


11.1 


372 


4.1 



"Values at GC-DNA/HT = 30, evolution not complete. 
•"Values at TSMC-RNA/H = 30, evolution not complete. 
•"Values at TSGC-RNA/H = 30, evolution not complete. 
''Values at TSl-RNA/HT = 21, evolution not complete. 



three RNAs (TSMC, TSGC and TSl) yielded complexes 
whose spectroscopic and photophysical properties 
were similar to each other and to those observed upon 
HT complexation with GC-DNA (Table 1; Figure 5; 
Supplementary Figures S7-S9). Addition of the RNAs 
caused a marked emission enhancement (Figure 5A; 
Supplementary Figures SB and S9). Like with both 
HT-DNA complexes, the observation of single-exponential 
decays and of the independence from the emission and 
excitation spectra on the excitation and emission wave- 
lengths, respectively, indicated that single emitting 
HT-RNA complexes were formed. As observed with 
GC-DNA and, to a smaller extent, HT alone, the excitation 
spectra of the emitting species were quite strongly 
red-shifted (from +17 to +21 nm) relative to the absorption 
spectra obtained at the highest RNA/HT mole ratios 
achieved (Table 1; Figure 5B); however, titration ends 
were not reached (Figure 5A; Supplementary Figures S8 
and S9). This implies that large amounts of non- 
fluorescent, hkely aggregated species were stiU present 
at these RNA/HT ratios, and that these species did not 
transfer excitation energy to the emitting complexes, i.e. 
the aggregated and the emitting forms of HT did not 
coexist within the same HT-RNA molecular complex. 

The emission enhancement rates, i.e. the increase in 
the emission band areas with increasing nucleic acid 
(NA)/HT ratios, were larger for the three RNA-HT com- 
plexes than for the GC-DNA-HT complex, and all of 
them were much lower than that for the AT-DNA-HT 
minor groove complex (see the ^ht/na/^ht ratios in 
Table 1). In addition, the area plots in the insets (Figure 
5A; Supplementary Figures S8 and S9) showed some 
downward curvatures for the three RNAs, but not for 
GC-DNA (Supplementary Figure S7) at similar NA/HT 
ratios. This indicates shghtly higher stabilities of the fluor- 
escent complexes obtained with the RNAs than with 
GC-DNA. 

The most striking resemblance between the properties 
of the complexes with GC-DNA and the three RNAs 
concerns the fluorescence Hfetimes, all between 4.1 and 
4.3 ns (Table 1). These relatively long lifetimes indicate a 
loss of torsional freedom of HT in the complexes, 
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Figure 5. (A) Absorption (left) and fluorescence emission (right) evolutions of HT titrated witli TSMC-RNA. Insets: absorbance at 340 nm (left) and 
relative emission band (right) areas (both corrected for dilution) as functions of the TSMC-RNA/HT mole ratio. Excitation wavelength: 330 nm (see 
Supplementary data for details). (B) Absorption (solid lines) and fluorescence excitation (dashed lines) spectra of HT alone and titrated with 
AT-DNA, GC-DNA, TSMC-RNA, TSGC-RNA and TSl-RNA at the largest NA/HT mole ratios attained (see text and Table 1). Emission 
wavelengths are 490 nm for HT alone, 450 nm for HT/AT-DNA and 480 nm for the other complexes. 



consistent, as for GC-DNA, with an intercalative binding. 
On tlie otlier hand, the fluorescence excitation and, to a 
lesser extent, the absorption and einission spectra revealed 
some differences between the emitting complexes obtained 
with the three RNAs. Both the absorption and the excita- 
tion band maxima exhibited a progressive blue shift on 
moving from TSl (355 and 372 nm) to TSMC (346 and 
367 nm) to TSGC (342 and 360 nm). The emission maxima 
exhibited a similar, though less pronounced blue shift: 
485, 483 and 480 nm. Since absorption band shifts and 
intensity changes are connected with the nature and 
geometries of the possibly many (28), aggregated forms 
that prevail at low NA/H ratios, and with the details of 
the exciton interactions between the chromophores in 
these aggregates, an interpretation of this property seems 
iinpossible, and of liinited interest, at this stage. On the 
other hand, the positions of the excitation spectra are 
specific features of the single einitting complex which 
prevails at high RNA/HT ratios. Different shifts in these 
spectra can be interpreted both in terins of differences in 
absorption solvatochroinism and of different degrees of 
coplanarity of the two benzimidazole subunits of HT in 
the intercalation complexes. Under the assumption of a 
reduced accessibility of water to the complexed dye as 
compared with free HT, negative solvatochromism of 



HT has often been invoked, to account for the absorption 
red shift (+12 nm; Table 1) of minor-groove complexes of 
HT with AT-DNAs. However, intercalation might also 
induce a distortion of HT with a change in geometry, par- 
ticularly concerning the intramolecular torsional coordin- 
ate, thereby affecting the absorption and emission spectra. 
In this respect, the observed blue shifts in the excitation 
spectra would indicate increasing exposure to water and/ 
or increasing distortion of intercalated HT in the 
TSl-TSMC-TSGC RNA series. 

A possible mechanism for HT-TSMC intercalation 

From UV-Vis and fluorescence spectroscopy, it is clear 
that HT binds the RNAs studied in more than one way, 
but it is also clear that there is a preference for intercal- 
ation of HT at low HT/RNA ratios. In HT, the chromo- 
phoric bis-benzimidazole fragment (Figure IC, rings R2 
and R3) is flanked by rings Rl and R4 on either side, 
making it nearly impossible for the chromophoric 
fragment to intercalate without piercing through the 
RNA helix. Since there are no available experimentally 
determined structures showing such a binding mode, we 
performed molecular docking and MDs simulations to in- 
vestigate the mechanism by which HT might intercalate. 
Docking of HT to the CC mismatch region of TSMC was 
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performed as described in the 'Methods' section. Two 
poses, A and B (Supplementary Figure SI), were 
generated by doclcing in which HT lay in the major 
groove in opposite directions (related by 180° rotation). 
In the A pose, ring R4 was pointed toward CI 6 of TSMC, 
and Rl was oriented toward C5. In the B pose, ring Rl 
was oriented toward CI 6, while R4 was toward C5. These 
two docking poses were used as starting structures for 
independent MDs simulations. 

During the first step of the two step simulation protocol 
employed, the RNA was restrained while HT was free to 
move (see 'Methods' section for details). In the simulation 
starting with pose A, HT remained in the major groove 
in contact with the RNA throughout the simulation and 
the interactions between TSMC and HT were optimized. 
In the simulation starting with HT docked in pose B, HT 
was released into solvent during the tirst nanosecond of 
the simulation (see Supplementary Data Discussion). 
Therefore, pose B was discarded, and the final structure 
from the simulation with pose A, Al, was used as the 
starting structure in simulation A2, in which the CC 
mismatch, one flanking base pair on either side of the 
mismatch and HT were free while the rest of the RNA 
was restrained (see 'Methods' section for details). To our 
surprise, during simulation A2, HT indeed penetrated 
through the helix, thus adopting an intercalative binding 
mode. An interesting chain of events (Figure 6) led to this 
induced fit intercalative binding mode (Figure 2B). HT slid 



over the furrow in the surface formed by the CC mismatch 
such that HT-H6 of the ring Rl could make a H-bond with 
the backbone C5-02P. While the ring Rl was so anchored, 
a rotation, driven by the possibility of H-bonding between 
HT-H3 and TSMC-G 17-06, occurred at the y bond 
(Figure IB and C) between 7 and 9ns. This was followed 
by the formation of a H-bond between HT-Hl and the 
backbone TSMC-G 17-02P at 9.2 ns. Simultaneously, the 
TSMC-C16 base began to flip out to stack against the ring 
R2 of HT. Stacking against C16 and H-bonding between 
HT-N3 and TSMC-G 17-N7 were the key interactions that 
permitted HT to reorient and penetrate through the helix, 
ring Rl first, at 11.2 ns. 

It is imperative to mention that the simulation A2 was 
run for only 24.6 ns, implying insufficient sampling to es- 
tablish the binding mode definitively. At the same time, 
it is difficult to sample completely enough to establish the 
binding mode. Therefore, the induced-fit mechanism 
of intercalation discussed here may not be the main 
method of interaction, rather, it represents one possible 
interaction mode. 

Validation of the intercalation binding model 

Structural stability induced upon intercalation 

HT began intercalating into the CC mismatch only after 
~9ns of the A2 simulation. In the preceding period, the 
interactions of HT in the major groove and its ahgnment 
with the mismatch were adjusted to allow for the 
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Figure 6. Top: RMSD versus time plot for TSMC residues and HT during simulation A2. C5 and C16 are the residues in the mismatch, C4, G6, 
C15 and G17 are the residues flanking the mismatch in the stem; all these were unrestrained during the simulation. Restrained residues refer to the 
remaining residues of TSMC. Bottom: Snapshots of the HT-TSMC complex at various timepoints during the simulation. TSMC is shown in cartoon, 
whereas HT is shown in stick representation. The residues are colored by same scheme as in the RMSD plot. 
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subsequent intercalation. It can be considered that the 
period up to ~9ns represents the interactions of HT 
with the major groove, without any intercalative compo- 
nent. Fluctuating root mean square deviation (RMSD) 
values for HT (Figure 6) during this time suggest that 
binding in the major groove did not lead to a very stable 
complex. High fluctuations in the RMSD of C5, CI 6 and 
HT during the 9-15 ns period represent the intercalating 
event. Thereafter, the RMSD values stabilize indicating 
formation of a structurally stable intercalated complex. 
At ~2 1.5 ns, the intercalated HT slid further into the 
binding pocket formed to optimize the stacking against 
bases C4, C5, G6, C16 and G17. The phenolic hydroxyl 
group on HT shifted from H-bonding to C3-02P to 
C4-02P. C5 also slightly readjusted for better stacking. 

The reduced mobihty of HT upon intercalation as 
compared with groove binding was also evident from the 
change in atomic RMS fluctuations (RMSF) for HT, 
which reduced from 1 63 A during the first 9 ns of the simu- 
lation to 16 A toward the end and after discounting the 
sliding event at 21.5 ns. Similarly to HT, the CC mismatch 
of the RN A also experienced improved structural stabihty 
upon intercalation (Supplementary Figure SIO). In free 
(unliganded) TSMC, the mismatch residues C5 and C16 
had high RMSF of 42 and 38 A, respectively. During the 
first 9 ns of simulation A2, i.e. before initiation of HT 
intercalation, the RMSF of C5 had decreased to 22 A, 
whereas C16 was almost unaltered at 41 A. After intercal- 
ation, i.e. 1 5-24.6 ns, the RMSF of C5 increased slightly 
to 26 A; however, RMSF of C16 reduced significantly to 
19 A, implying that C16 was additionally stabilized by 
intercalation of HT as compared with earher when HT 
was interacting with the major groove. Irrespective of 
the presence or the binding mode of HT, the flanking 
base pairs above and below the CC mismatch i.e. C4, 
G6, CI 5 and G17, which were also aUowed unrestrained 
movement in the A2 simulation, continued fluctuating to a 
similar extent as in the free TSMC. 

Intercalation is the energetically favorable binding mode 

Since it was of interest to deduce the effect of intercalation 
on the binding free energy of the complex, the trajectory 
A2 was split into segments of 5 ns each; the first two 
segments representing the major groove binding mode, 
the last two segments representing intercalation. When 
the HT interactions were limited to the major groove, 
e.g. during the first 5 ns, the computed binding free 
energy was only — 13.7kcal/mol, whereas upon intercal- 
ation i.e. during the last two segments at 15-20ns and 
20-24.6 ns, it improved significantly to ~— 30kcal/mol 
(Table 2). Therefore, compared with binding in the 
major groove, the intercalative mode is energetically 
more favorable. 

Solvent exposure and structural rigidity of 
HT his-henzimidazole subunit 

The solvent exposed surface area (SASA) and torsional 
rigidity of the HT bis-benzimidazole fragment, i.e. rings 
R2 and R3 (Figure IC), are responsible for the measured 
absorption and fluorescence properties of HT. An analysis 
of these structural properties was performed to examine 



Table 2. Computed binding free energy of the HT-TSMC complex, 
calculated with a continuum solvent model (PBSA) for consecutive 
segments of the simulated trajectory 



Time 




Ai^Elcctrostatic 




AGsA 


Binding 


frame 


(kcal/mol) 


(kcal/mol) 


(kcal/mol) 


(kcal/mol) 


free energy 


(ns) 










(kcal/mol) 


0-5 


-21.3 ± 4.6 


-530.0 ± 10.4 


539.5 ± 11.3 


-2.0 ± 0.5 


-13.7 ± 3.2 


5-10 


-25.1 ± 3.2 


-545.1 ± 14.6 


552.4 ± 13.5 


-2.1 ± 0.2 


-19.9 ± 5.2 


10-15 


-33.6 ± 9.7 


-467.5 ± 63.1 


480.7 ± 56.0 


-2.2 ± 0.6 


-22.6 ± 3.8 


15-20 


-45.4 ± 2.6 


-446.6 ± 8.1 


465.3 ± 7.4 


-3.3 ± 0.1 


-30.0 ± 2.9 


20-24.6 


-44.9 ± 2.7 


-425.1 ± 19.3 


442.2 ± 19.2 


-3.1 ± 0.2 


-30.9 ± 2.9 



the consistency of the simulated model with the experi- 
mental data. In free HT, the average computed SASA 
of the chromophoric fragment was 280 A^, while a reduc- 
tion in SASA was observed upon binding TSMC 
(Supplementary Figure SI 1). For the first 1 1 ns of simula- 
tion A2, i.e. during binding in the major groove and a part 
of the transition toward intercalation, the SASA of the 
bis-benzimidazole fragment could be broadly divided 
into two clusters, at 160 and 260 A^; but by the time of 
stabilization of intercalation at 15 ns, the SASA had 
decreased significantly to 65 A^. 

During the simulation of free HT, the torsion angle P 
between the rings R2 and R3 fluctuates around two 
values: 1°±19° and 181° ±17° (Supplementary Figure 
SI 2), indicating free rotation around the bond between 
the two favored planar geometries. However, after 
binding the RNA, such rotation was inhibited and the 
torsion angle p fluctuated at —5° ±17° for the complete 
run. Moreover, the magnitude of the fluctuation of the p 
torsion angle was shghtly reduced upon intercalation, 
—7° ±13° during 1 5-24.6 ns, compared with groove 
binding, —4° ±19° during 0-9 ns, although the effect 
was not as prominent as for the SASA. These data, espe- 
cially for the intercalation binding mode, are consistent 
with the reduced solvent exposure and increased rigidity 
of HT observed in UV-visible and fluorescence 
experiments. 

Based on the analysis of the MD data, the decreasing 
magnitude of changes in the fluorescence properties of HT 
when bound to TSl > TSMC > TSGC can also be 
rationalized. Each helical turn of an A-form RNA helix 
consists of 10-11 base pairs. Therefore, the TSl stem, 
which consists of 12 base pairs, incorporates a complete 
turn, whereas TSMC, with a stem of 8 base pairs, does 
not. This impUes a deeper major groove for TSl, capable 
of shielding HT from solvent better than TSMC, in both 
groove and intercalative binding modes. Due to the 
absence of the CC mismatch, intercalation would be less 
feasible in TSGC and groove binding would be expected 
to predominate. Therefore, the SASA of the HT 
choromophore, though less than in free HT, would not 
reduce as much as in TSl or TSMC where intercalation 
is more feasible. 

In the structural model presented here (Figure 2B), the 
methylpiperazine moiety (ring Rl) of HT protrudes into 
the minor groove, whereas the phenolic moiety (ring R4) 
points toward the major groove of the RNA. It would be 
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logical to expect that a model with the reverse orienta- 
tion, i.e. ring R4 protruding into the minor groove 
while the ring Rl extended toward the major groove 
would be an equally feasible option. In such a structure, 
HT-H6 could form a hydrogen bond with the backbone 
phosphate groups of C3 or C4 and ring R4 could shield 
the C4 of TSMC from solvent. Although such a model 
could be energetically favorable, our NMR data argue 
against this mode of binding. Depending on the rate of 
exchange between the free and bound forms, the 'H 
NMR signals of HT would shift or broaden out. This 
effect would be more pronounced for the protons making 
direct contacts with the RNA. In the NMR spectrum, the 
protons from aromatic rings R2, R3 and R4 of HT 
resonate between 6.5 and 7.5 ppm, whereas the aliphatic 
protons for the methyl group attached to Rl appear at 
2.8 ppm. During titrations of HT into RNA, the signals 
for the methyl protons in Rl built up with increasing 
concentrations of HT whereas the signals of the 
aromatic protons remain broadened beyond detection 
(Supplementary Figure S13A). Similarly, in the TOCSY 
spectra recorded for free HT in ^H20 and subsequent 
titrations of TSMC into HT, the signals for the 
aromatic rings of HT were bleached upon the first 
instance of RNA addition, with an HT:RNA ratio of 
1:0.25, whereas the signals from Rl were neither shifted 
nor significantly broadened (Supplementary Figure 
S13B). These observations suggest that ring Rl of HT 
is relatively free and not involved in interactions with 
TSMC. Therefore, the orientation of the intercalated 
HT should be as proposed by our model and not the 
one flipped by 180°. 

Summary and conclusions 

Development of resistance due to overexpression of TS 
protein is a problem associated with most of the drugs 
inhibiting the TS protein. Targeting the TS mRNA could 
provide a means to tackle this problem. The biological 
assays characterizing TS protein and TS mRNA levels in 
cells treated by HT showed that HT treatment resulted in 
a reduction in TS protein levels without altering the TS 
mRNA levels, indicating that HT modulates TS expres- 
sion by interfering with TS translation. Therefore, 
although HT itself is not a good drug candidate, its bio- 
logical activity suggests that it could be used as a lead 
molecule to design drugs targeting the TS mRNA. Since 
structural information on the TS mRNA-HT complex 
could tremendously aid these drug design efforts, we 
obtained insights into the complex formation using the 
TSMC RNA construct as a model for the Site I of TS 
mRNA. Although the existence of more than one binding 
mode was observed for the TSMC-HT complex, NMR, 
UV-Vis and fluorescence spectroscopy based evidence 
support intercalation as the dominant binding mode at 
low TSMC/HT ratios. The structural model developed 
using docking and MDs simulations confirmed that inter- 
calation at the CC mismatch in the RNA is sterically 
feasible and energetically favorable compared with the 
alternative possibihty of groove binding. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Figures 1-13, Supplementary Discussion 
and Supplementary References [29-31]. 
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